Efficient Oblivious Parallel Sorting on the MasPar MP-1

نویسندگان

  • Klaus Brockmann
  • Rolf Wanka
چکیده

We address the problem of sorting a large number N of keys on a MasPar MP-1 parallel SIMD machine of moderate size P where the processing elements (PEs) are interconnected as a toroidal mesh and have 16KB local storage each. We present a comparative study of implementations of the following deterministic oblivious sorting methods: Bitonic Sort, Odd-Even Merge Sort, and FastSort. We successfully use the guarded split&merge operation introduced by Rüb. The experiments and investigations in a simple, parameterized, analytical model show that, with this operation, from a certain ratio N P upwards both OddEven Merge Sort and FastSort become faster on average than the up to the present fastest, sophisticated implementation of Bitonic Sort by Prins. Though it is not as efficient as Odd-Even Merge Sort, FastSort is to our knowledge the first method specially tailored to the mesh architecture that can be, when implemented, competitive on average with a mesh-adaptation of Bitonic Sort for large N P.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

B-Flashsort: A High-performance Sort for the MasPar MP-1 and MP-2 Implementation Description

The sorting of large data sets is a key step in a great variety of applications such as query processing, polygon rendering, statistical analysis and optimization problems. While efficient sorting techniques and implementations have long been available for sequential computers, the problem remains a challenge for parallel computers. One reason is that efficient sequential algorithms take into a...

متن کامل

Efficient Bitonic Sorting of Large Arrays on the MasPar MP-1†

The problem of sorting a collection of values on a mesh-connected distributed-memory SIMD computer using variants of Batcher's Bitonic sort algorithm is considered for the case where the number of values exceeds the number of processors in the machine. In this setting the number of comparisons can be reduced asymptotically if the processors have addressing autonomy (locally indirect addressing)...

متن کامل

Comparison of Direct Particle Simulation on the MasPar MP - 2 and the Connection Machine CM - 2

Particle simulation as applied in the direct simulation Monte Carlo (DSMC) method is a technique for analyzing low density flows and is used extensively for engineering analysis of aerospace vehicles. This work compares two implementations of this method on outwardly similar massively parallel architectures. The MasPar MP-2 and the Connection Machine CM-2 are both massively parallel SIMD archit...

متن کامل

Fast deterministic sorting on large parallel machines

Many sorting algorithms that perform well on uniformly distributed data suffer significant performance degradation on non-random data. Unfortunately many real-world applications require sorting on data that is not uniformly distributed. In this paper, we consider distributions of varying entropies. We describe A-Ranksort, a new sorting algorithm for parallel machines, whose behavior on input di...

متن کامل

A Practical Algorithm for Integer Sorting on a Mesh-connected Computer

This paper presents count sort a parallel algorithmfor mesh connected computers to sort integers where the range of inputs is known A straightforward counting technique that has not been implemented previously in parallel sorting algorithms is presented On a mesh connected computer with p N p N processors we are able to sort N integers in the range p N in time c p N where c is very small For pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997